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Dr.  King’s  work  (’07,  ’09),  “On  Studies  on  Sex-determination 
in  Amphibians,”  suggests  an  interesting  problem  which  may  be 
put  in  the  following  form  : 

A jar  contains  a large  number  of  male  and  female  tadpoles,  the 
proportion  of  each  being  unknown  : if  on  picking  out  m -f  n tad- 
poles, m are  found  to  be  males  and  71  females,  to  find  the  prob- 
ability that  the  ratio  of  the  number  of  either  sex  to  the  entire  lot 
lies  between  given  limits. 

It  will  be  seen  that  problems  of  this  nature  occur  frequently 
in  biological  investigations  and  that  it  is  of  importance  to  have 
some  method  for  determining  the  accuracy  of  the  observed  pro- 
portions. This  can  be  done  by  means  of  the  formula  given  below. 
As  the  development  of  the  formula  is  somewhat  complicated  I 
shall  present  the  entire  process  of  the  mathematical  treatment  of 
the  solutions  based  on  the  theorem  of  Bayes,  together  with  one 
application.  Although  such  an  elementary  exposition  of  the  sub- 
ject will  be  superfluous  for  one  who  is  familiar  with  the  theory  of 
probabilities  nevertheless  for  others  it  may  be  helpful. 

If  p denote  the  probability  that  an  event  will  happen,  then 
(1  — p)  is  the  probability  that  the  event  will  fail.  If  the  proba- 
bility that  the  event  will  fail  on  any  single  trial  is  (1  —p),  the 
probability  that  it  will  fail  every  time  is  (1  —p)n.  The  proba- 
bility that  it  will  happen  on  the  first  trial  and  fail  on  the  succeed- 
ing n — I trial  is/(i  — p)n~l.  But  the  event  is  just  as  likely  to 
happen  on  the  second,  third,  etc.,  trials  as  on  the  first.  Hence 
the  probability  that  the  event  will  happen  just  once  in  the  n 
trials  is 

np{  I 

Continuing  this  process,  we  can  easily  see  that  the  probability 
that  it  will  happen  m times  in  m -f  n trials  is 
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in  + ft 

o> 


From  (i)  we  can  obtain  directly  the  probability  that  exactly  m 
males  and  //females  occur  in  in  + //^drawings  and  the  expression 
will  be 


P 


\m  + n 

| in \n 


x(l  -xf 


(2) 


where  x denotes  males  and  (i  — x)  females. 

If  we  call  the  right  hand  member  of  (2)  yt  then 


y = cxm(  1 — x)n 


and  this  equation  represents  a curve  with  a zero  ordinate  when 
x = o and  x = l . Since  the  elementary  area  under  the  curve  is 


| in  -f  11 
| m 11 


xm(i  — x)ndx, 


the  number  of  cases  which  occur  between  the  two  limits  (a,  b ) is 
proportional  to  the  sum  of  the  differentials  taken  from  a to  b , or 


I in  4-  11  rh 

— ~ = I xm(\  — x)ndx 

\tn\n  Ja  v J 


Also  the  totality  of  cases  will  be  proportional  to  the  sum  of  the 
differentials  taken  from  o to  1,  or 


m 4-  11 
I m n 


1 — x)ndx. 


Hence  the  probability  that  the  ratio  of  the  males  in  the  jar  to 
the  entire  lot  lies  between  the  two  limits  is 


P = 


£ xm(  1 — xfdx 
J"  xm(i  — x)ndx 


(3) 


This  is  the  well-known  theorem  of  Bayes.1  This  equation  as  it 

1 See  Todhunter’s  “ History  of  the  Theory  of  Probability  from  the  Time  of  Pascal 
to  that  of  Laplace,”  1865. 
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stands  is  applicable  to  cases  where  the  number  of  observations 
is  small.  For  cases  where  the  number  of  observations  is  large 
we  must  modify  this  still  further. 

If  we  suppose  the  two  limits  to  be  m/s  db  Q where  ^ = m 4-  n , 
then  equation  (3)  may  be  written  in  the  following  form  : 


P 


Jf*m/s+6 

xm(\  — x)ndx 

m/s—0 

jf*  xm(i  — x)ndx 


By  successive  integration  by  parts  the  denominator  is  evaluated, 
giving 

JC'  , . , m\n\ 

1 xm(  1 —x)ndx—  — (a) 

o v ; {m  -f  n + 1)  ! W 

If  we  let  x = m/s  4-  z the  numerator  becomes 

(*  7 in  \m  / n \n 

Jj  [j  + V (1-* )dz- 

which  is  approximately 

mmnn  r+e  -d3?2 

I £ 2mndz. 

J-e 


With  the  above  transformation  the  formula  becomes 


P = 


(7  + 1 ) ! mmnn 
m\n\  ss 


L 


+ 0 S3g2 

£ 2mndz. 

0 


(5) 


Now  if  we  apply  Stirling’s  formula  for  large  numbers  (4) 
becomes 

( j + 1 ) ! j3 

m ! n ! mmnn  \ 27:  inn 


Therefore  (5)  may  be  written  in  the  following  form  : 


If  we  assume 
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and  substitute  co  for 


e 


J 


J8 

2 mu 


then  as  a final  form  we  have 


Since  the  above  expression  is  a well-known  equation  for  the 
probability  integral,  the  degree  of  probability  for  any  given  limits 
may  be  determined  from  the  table.  For  computation  we  have 
the  following  relation  : 


= dJ  or  0 = co 

\ 2mn  X s> 


and 


p = 


As  will  be  seen  from  the  above  relations,  the  degree  of  prob- 
ability is  proportional  to  the  amount  of  deviation.  Therefore  in 
any  given  case  we  can  either  increase  or  decrease  the  value  of 
probability  by  changing  the  value  of  the  deviation.  Thus  in 
order  to  facilitate  a comparison  of  several  sets  of  data,  it  would 
be  advantageous  to  fix  the  value  of  the  probability  and  then 
determine  the  corresponding  amount  of  the  deviation.  If  we  take 
0.75  for  the  value  of  the  probability,  it  will  be  certainly  high 
enough  for  practical  purposes.  If  we  adopt  this  system,  then  the 
corresponding  value  of  co  is  fixed  and  is  equal  to  0.814.  Thus 
we  do  not  even  need  to  use  the  table  to  determine  the  amount  of 
deviation.  If  however  one  wishes  to  find  any  other  value  of  the 
probability  than  I have  proposed  (i.  e.,  0.75),  such  can  be  readily 
obtained  from  the  table.  Thus  the  determination  of  the  deviation 
can  be  made  by  a simple  arithmetical  process. 

The  following  will  illustrate  a method  of  determination.  Dr. 
King  has  kindly  supplied  the  data  for  this  purpose  and  I wish  to 
thank  her  for  that  material. 

Example:  Out  of  16, 100  tadpoles,  9,949  are  examined.  5,136 
are  found  to  be  females  and  4,813  males.  Find  the  probability 
that  the  ratio  of  females  to  the  entire  lot  lies  between  given  limits. 
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We  have 

P = (p{io)  - 0.75. 

Since  we  adopted  the  fixed  value  for  probability  (0.75)  the 
value  of  co  is  also  fixed  and  equal  to  0.814.  Thus  our  problem 
is  therefore  to  find  a value  of  d when  the  value  of  p is  known. 

0 = 0.8 14  I2  x _ 0.814  x 0.00709  = 0.0058. 

\ 99,493 

Deviation  = 16,100  x 0.0058  = ± 93. 

Total  number  of  females  = 16, 100  x 5,136/9,949  = 8,311 

=*=  93- 

With  this  value  of  probability  we  conclude  that  the  total 
number  of  females  in  the  entire  lot  is  neither  greater  than  8,404 
nor  less  than  8,218. 


co 


J 


2 mn 


